PERTURBATIONS OF JORDAN MATRICES 



E B DAVIES AND MILDRED HAGER 



Abstract. We consider perturbations of a large Jordan matrix, either random 
and small in norm or of small rank. In both cases we show that most of the 
eigenvalues of the perturbed matrix are very close to a circle with centre at 
the origin. In the case of random perturbations we obtain an estimate of the 
number of eigenvalues that are well inside the circle in a certain asymptotic 
regime. In the case of finite rank perturbations we completely determine the 
spectral asymptotics as the size of the matrix increases. The paper provides an 
elementary illustration of some standard techniques of spectral theory. 



1. Introduction 

It is well known that the eigenvalues of large non-normal matrices can be highly 
unstable under very small perturbations. In this note we discuss a very simple 
example of this phenomenon. We show that in a wide variety of cases almost all of 
the eigenvalues of a slightly perturbed Jordan matrix lie near a circle with centre 
at the origin, with high probability in the random case. We also examine the 
exceptional eigenvalues, which remain well inside the circle. 

A quantitative measure of spectral instability is provided by the notion of pseu- 
dospectra, which become interesting when the operator involved is far from being 
normal; see pHl E] for detailed discussions and many references. If 5 > the 
5-pseudospectra of an operator A are defined by 

Spec 5 (A) = Spec(A) U {z £ Spec(A) : \\{A - z y 1 \\ > cT 1 } 

(J Spec{A + 5K), W 

{iC:||if |j<l} 

where Spec denotes the spectrum of a matrix. The second equality in ([1]) im- 
plies that a perturbation of A of size 5 can move the eigenvalues anywhere inside 
Spec^A). In particular the computed eigenvalues of a large matrix may be very 
inaccurate if Spec^A) is a large region, where 5 is the rounding error of the com- 
putations. In this note, we study this phenomenon in some detail for the Jordan 
block matrix, perturbed either by a matrix of small rank, in which case the analysis 
is much sharper, or by a random matrix with a small norm. The problem studied 
in this paper was proposed by Zworski, who showed how the general methods of 
Sjostrand and Zworski ([9]) could be adapted to this particular setting. Our results 
go beyond the theory of Lidskii ([HE]) by allowing larger (but still extremely small) 
perturbations, for which the Puiseux series is not convergent and the eigenvalues 
are not where the first few terms of that series would predict. 
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Figure 1. Eigenvalues of a small random perturbation of the Jor- 
dan matrix J, with N = 100 and 5 = 10~ 10 . 

We define the standard N x N Jordan matrix J by 




1 if s — r + 1, 
otherwise, 



where r, s = 1, N, and we always assume N > 2 from now on. Figure 1 shows 
the results of a MATLAB computation of Spec(J + SK), when N = 100, 5 = 10~ 10 
and K is a complex gaussian random matrix. Our goal is to explain the form of 
the figure and others obtained by similar methods. 

Most of the eigenvalues accumulate around a circle with centre at the origin, hence 
far away from Spec(J) = {0}, even though the perturbation is very small in norm 
(with a high probability). In Section [21 we explain the origin of this instability and 
prove that it is very likely to happen for this type of perturbation. The section 
illustrates the methods and ideas of [3], [I] in a very concrete setting. 

If one adds a strictly upper triangular matrix to J then the spectrum is not changed. 
In Section [3j we therefore concentrate on perturbations whose non-zero entries are 
all close to the bottom left-hand corner of the matrix; some generalizations are 
considered in Section HI We give a complete asymptotic analysis of the spectrum 
for all such perturbations. The problem is described in detail in the following 
section. The equation to be solved is written down in Theorem [TTJ The asymptotic 
form of the solutions to this equation is described in Theorem [T21 that will also be 
related to a Grushin problem as in [Sj, see also Section [21 and then in much more 
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detail in Theorem [T3l Section 14.21 describes the asymptotics of the zeros of some 
more general polynomials. 



2. Random perturbations 

Let us start by investigating the pseudospectra of the Jordan matrix. Let D(w,r) = 
{z G C : \z — w\ < r}. 

Lemma 1. IfO < \z\ < 1, then 

\z\~ N < IKJ-z)- 1 ]] < N\z\~ N , (2) 

which implies that for 5 < 1/N, 

D(0, 5 1/JV ) C Spec 5 ( J) C D(0, {5Ny/ N ) . (3) 

Proof. Let e(z) = (1, ... } z N ~ 1 y. If < \z\ < 1, the identity 

\\(J-z)e(z)\\ = \z\ N <\z\ N \\e(z)\\ 

implies the lower bound in (T5]). We have the following expression for the resolvent 
for z 7^ 0: 

-z'-*- 1 if 1 < r < s < N, 
otherwise. 



(J - *Ki - 

We conclude that for \z\ < 1, 



N 

\\{J ~ zr'uW 2 < (E \A- ] ) 2 \\u\\ 2 < {N\z\- N f\\uf . (4) 
j'=i 

□ 

Using we deduce that the eigenvalues of a perturbation of norm 5 may be 
anywhere in D(Q, (5N) 1/N ). In Fi gure 1 most, but not all, of the eigenvalues are 
close to the boundary of this disc, and our goal is to understand why this is the 
case. 

Theorem 2. Let K be a N x N random matrix such that 

P[\\K\\ < 1] > l- Pl {N) , (5) 

and 

P[\K NA \<s]<p 2 (s;N) . (6) 
Then for any 0<5<j^,a>5, and a > 0, with probability at least 

l- Pl (N)-p 2 (ba;N) , (7) 

we have 

Spec( J + 5K) C D(0, (SN) 1/N ) , (8) 

and 

#(Spec(J + 5K) n D(0, (5N) 1 / N e' (7 )) < -(InN - In a) . (9) 
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The following theorem is obtained by putting a = N~ 3 and estimating pi(N) and 
P3(5a, N). We see that for any fixed a > 0, the proportion of eigenvalues that lie 
in the annulus 

{z : {5Nff N e- a < \z\ < (SN) 1 ^} 
converges to one with probability one as N — > oo. 

Theorem 3. Let K be a N x N random matrix with its entries independently 
and identically distributed according to a complex gaussian law centered at and 
of variance 1. Let K = K/N 2 . Then for any < 5 < N~ 3 and a > 0, with 
probability at least 1 — 26 /N 2 , is valid and 

#(Spec( J + 8K) D D{0, (5A0 1/A V CT )) < - In iV . (10) 
Choosing 5 = 7 , we also obtain the following result. 

Corollary 4. Let K be as in Theorem^ Then for any < 7 < N~ 3 / N and a > 0, 
with probability at least 1 — 2QN 2 , 

Spec( J + j N K) C D(0, jN l / N ) (11) 

and 

#(Spec(J + 7 Af K) n£)(0,7iV 1/Jv e- <7 )) < ^lniV . (12) 

2.1. The Grushin problem. For the proof of Theorem [21 we set up a Grushin 
problem as in P Sect. 2.2]. Let A e M N (C), let m G N, and let i?+ and be 
m x N matrices. We put 

A ={r + R Q~ ) G M*+m(C) . (13) 

Lemma 5. (Schur, Grushin) If A is invertible, and 

8 = 

is the matrix inverse of A, then A is invertible if and only if det(E j_) 7^ 0. 

Proof. If A is invertible and det(E_ + ) 7^ 0, then 

A(E - E + EZ\E^) = 1 , (14) 
hence A is invertible. The converse affirmation goes along the same path. □ 
Corollary 6. If 

m = 1 , R_ = 6 7v , i? + = e' x , (15) 

where ex, e2, ejv is the standard basis of column vectors in C N , then A is invert- 
ible if and only if det(A) 7^ 0, where A is obtained by deleting the first column and 
last row of A. In that case 

E_ + = (-if^B. 
K ! det(A) 



E E+ 

E— 4- 



Example 7. If A = J — zl and R± are as in pojl . we write J = A. By the 
previous corollary J is invertible, and 

z T ~ s ~ x ifs + l<r<N + l, 
€,.„ ■ { z r ~ l if l<r<iV + lands = iV + l, 
otherwise. 



Assuming \z\ < 1 we deduce that 

" V t (.:)|| < ,\ \ £_+(.:) = - 



£(z)\\<N + l, \\E(z)\\ < N, \\E + (z)\\ <N l '\ E_ + (z" A 



S(0)\\<1, \\E(0)\\ <1, p?±(0)||<l, £_ + (0) = 0. (16) 



Finally, using ( tHj) , we can also find the explicit expression for the resolvent. 

2.2. Perturbation. Let us assume that ||if|| < 1. Then by © holds. We 
analyze the part of the spectrum within D(Q, (£iV) 1//Ar ) in more detail by using the 
Grushin problem for J + 5K. We will show that the matrix J 5 = A obtained by 
putting A = J + 5K — zl in (fT3l) may be inverted by using a Neumann series. 
Denoting the inverse by £ 5 , this implies that Spec(J + 5K) fl D(0,R) coincides 
with the set of zeros of E s _ + (z) such that \z\ < R. 

Lemma 8. Let \\K\\ < 1, and let 5 < 1/2N. Then ifN>2, for any R<1, 

Spec( J + 5K) n D(0, R) = {z e D(0, R); E 5 _ + (z) = 0} , (17) 

where 

Et + (z)=z N - Sp K (z) + q SK (z) (18) 



-+ 

and 



N-l 



p K (z) = £ K (N „ r)>s+1 z r+s . (19) 

r,s=0 

If\z\ < (5iV) 1/7V = R, we have 

||£-+||oc <35N , (20) 

in L°°(D(0,i2)), and 

|£_+(0)|> 5(\K NA \ -25) . (21) 

Proof. The resolvent expansion implies invertibility provided \\SK\\ \\S \\ < 1, which 
is proved by using (|T6l) and 5||iT||A^ < 1/2. Moreover we have the following Neu- 
mann series expansion for the inverse: 

F s_ F o , ( Ej>iE(-5KEY Ej>i(-E5KYE + ) \ 

+ {j2 J >iE.(-5KEy E j >iE-(-SKEy-\-6KE + ) ) ' [ZZ) 



Evaluating the bottom right coefficient of each term yields (|18l) with 
p K (z) = E_KE + , 

q SK ( z ) = 5 2 E_KEKE + - 5 3 E_KEKEKE + + ... . 
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The bounds on the various quantities now follow by combining ( fTBl . \\K\\ < 1, 
5 < 1/2N, and \z\ < (SN) 1 ^ : 

Ibiclloo < \\K\\N<N , 

\\q S K\\cc< 25 2 \\K\\ 2 N 2 <5N, (23) 
||S_+||oo < SN + 5N + 5N < 35N . 
Moreover, using also the second line of (j!6p . 

\p K (0)\ = \K NA \, 

\qsk(0)\ < 25 2 \\K\\ 2 < 25 2 , (24) 
\E 5 _ + (0)\>5(\K NA \-25) . 

□ 

To estimate the number of zeros of the holomorphic function E 5 _ + (z) we will need 
the next proposition. 

2.3. Counting the zeros and proof of Theorem [21 

Proposition 9 (The Poisson- Jensen formula). Let f be a holomorphic function 
that does not vanish anywhere on the boundary of D(0, R), where < R < oo. Let 
M be the number of zeros of f in D(0, Re~ a ) for some positive constant a. Then 

M<if-ln„ JW>I ). (25) 



a 



L°°(D(0,i?)) . 



This is a direct consequence of formula (1.2'), p. 163 in [7j: if / is a holomorphic 
function in D(0,R) with zeros a M there, then 

ln|/(0)| = ^ ln\f(Re i6 )\d9 - E In A < In WfW^ - £ In A. (26) 
Hence 

- In |/(0)i+ In H/lloo > E ln^> £ ln TT^ = Ma, ( 2 7) 

|a M |<R \a„\<Re-° Ue 

which is our proposition. 

Proof of Theorem^ With probability at least 1 —pi(N), we know that \\K\\ < 1, 
which we will assume from now on. Then Lemma M holds. Setting f(z) = E 5 _ + (z) 
and R = (SN) 1 ^ , this implies that ||/||l°°(d(o,.r)) < 35 N. We also assume that / 
does not vanish on \z\ = R, since otherwise we may diminish R slightly and obtain 
our result as a limit over increasing radii fulfilling this assumption. 

If a > 5, with probability at least 1 — p 2 (5a; N), we have |-fGv,i| — 25 > 3a, hence 
again by Lemma El 1/(0)1 > 35a. 

Thus we obtain that with probability at least 1 — pi(N) — P2{5a; N), 

h ' /(0)l <-ln|^<lniV-ln a . (28) 



a 2 } 



(29) 



Lemma \E\ and Proposition [9] imply that with the same probability 

#(Spec(J + SK) n D(0, (5N) 1/N e- a )) < -(lnJV - In a) , 
which finishes the proof of Theorem [21 □ 

2.4. Proof of Theorem [3]. We end this section by showing that a complex gauss- 
ian random perturbation fulfills the assumptions of Theorem |2j 

Proof. Let K be as in Theorem [HJ Let us first show that d^D is fulfilled with 
Pl (N) = 1/N 2 . 

If K is a random matrix with independent complex gaussian normal distributed 
entries, and a > then 

N 

P[\\K\\ > a) < \K jk \ 2 > 
j,k=l 

N 

< E[a- 2 £ \K jh \*\ = N 2 /a 2 . 
j,k=i 

Hence 

Pdl^H < 1] = P[\\K\\ < N 2 } > 1 - N- 2 . (30) 
Next, we have to estimate the following probability: 

P[\K Nll \< 8 ] = P[\K N ,i\ < sN 2 ] 

< ( S N 2 ) 2 =p 2 (s;N). 

Let us choose a = N~ 3 . Then p 2 (5a; ^) = (5/iV) 2 , and Theorem [2] implies that 
with probability at least 1 — 1/N 2 — (5/N) 2 , we have 

#(Spec(J + 5K)nD(0,(5N) 1 / N e- a )) < -lniV , 
which finishes the proof of Theorem [3l □ 

3. Small rank perturbations 

3.1. Description of the Example. Let C be a k x matrix with entries c r s and 
let 5 > be the perturbation parameter. We consider the spectrum of A := J + 5K 
where the N x N matrix K has the block form 

and the (zero) top right hand entry is of size (N — k) x (N — k). 

The asymptotic behaviour of Spec(A) depends on the choice of S. From this point 
onwards we assume that 5 := 7^ for some 7 G (0, 00). If < 7 < 1 then SK is 
a very small perturbation of J, but for 7 > 1 the reverse holds. Surprisingly the 
same analysis applies in both cases; the choice 7 := 1 is not special in any way. 
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-1 -0.8 -0.6 -0.4 -0.2 0.2 0.4 0.6 0.8 1 

Figure 2. Eigenvalues of the Matrix A of Example [TOl 

Our results may have connections with the analysis of paraorthogonal polynomials 
on the unit circle in [8]. 

The spectral behaviour is different for random perturbations of the type considered 
in section [2J The condition number of the diagonalizing matrix is much smaller in 
the random model, and the analysis is harder. For a fixed 7 G (0, 1) the appearance 
of the spectrum is similar, but as 7 — > 1 the number of eigenvalues inside the circle 
increases rapidly, and for 7 > 1 the eigenvalues appear to be randomly distributed. 

Example 10. Computations of the eigenvalues in our model tend to be numerically 
unstable because of the high condition numbers involved. We consider the example 
in which N = 80, 7 = 0.6, k = 3 and 





f 8 


\ 


C : = 


2 


5 




I 1 


-2 3/ 



The eigenvalues of A inside the circle are close to ±i/4, while the radius of the 
circle is close to 7. The condition number of the diagonalizing matrix is 4.5 x 10 17 , 
and increases for smaller 7 or larger N. 

Theorem 11. If 5 := 7^ where 7 G (0, 00) then 

Spec(A) = { 1Z : / = /(*)} 
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provided N is large enough, where 

2k-2 

f(z) := J2 brz r 

r=0 

and 

b r . 7 ^ ^ij- 

k—i+j=r+l 

Proof. The spectrum of A is the set of solutions of <7at(A) = 0, where 

g N (X) :=det(AI- J-8K). (31) 

Let g r {X) be the determinant of the r x r matrix obtained from XI — J + SK by 
deleting its top N — r rows and the leftmost N — r columns. By expanding the 
determinant (13T1) down the leftmost column and assuming that iV > 2(k + 1) we 
obtain 

g N (\) = A#at-i(A) - 5c M - 5c k _ ltl \ - 5c k ^ 2 ,A 2 - ■■■ ~ 5c 1>1 X k ~ 1 . 

The formula for <?at(A) follows inductively, and the proof is completed by making 
the change of variables A := 72:. □ 

Alternative proof using the Grushin problem. Inserting the special form of K in the 
series expansion for E s __, and using N > 2(A;+1), we see that the series only contains 
terms up to first order in 5, so no condition on the smallness of 7 is needed for 
convergence. Putting 5 = 7^ and A = 72; yields 

E 5 _ + (X) = X N - 5E^KE + = 1 N (z N - p K { 1Z )) (32) 

where px was defined in ( TT^j) . We finally observe that p K {jz) = f(z) for all z. 
Although the Grushin problem is equivalent to a direct analysis of the determinant 
in this particular case, it also permits estimates in cases where the determinant is 
quite hard to analyze directly. □ 

3.2. The Equation z N = f(z). Let U be a region in the complex plane that 
contains D(0, 1 + S) for some 5 > 0. Let / be a bounded analytic function defined 
on U. We assume that f(z) = has h distinct solutions Zj satisfying < 1, each 
with multiplicity m,. We put n := YJi=i m i- By reducing 5 > we may assume 
that \zi\ < 1 — S for all %. We will determine the distribution of the solutions of 
z N = f(z) asymptotically as N — ► 00. 

Theorem 12. For every e G (0, 5) there exists N £ such that if N > N e then 
z N = f(z) has rrii solutions in the e -neighbourhood of Zi for each i 6 {1, h}, no 
other solutions in D(0, 1 — e), no solutions in U\D(0, 1 + e) and N — n solutions 
in {z : 1 — e < \z\ < 1 + e}. 

Proof. If iV is large enough then (1 + e) > max{|/(,z)| : z G U}, so the equation 
has no solutions in U\D(0, 1 + e). By applying Rouche's theorem to z N — f(z) 
regarded as a small perturbation of z N , we see that for all large enough N the 
equation has iV solutions inside D(0, 1 + e). A similar argument but regarding 
f(z) asa small perturbation of f(z), implies that the equation has n solutions 
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inside D(0, 1 — e), provided N is large enough, and that these converge to the zeros 
of f(z) as A" — > oo. The remaining N — n solutions must lie in the stated annulus. 

In order to determine the asymptotic behaviour of the N — n solutions in the 
annulus as iV — > oo, we assume for simplicity that f(z) 7^ whenever \z\ = 1. We 
then put 

f(e ls ) := p(s)e^ 

where p(s) is positive and periodic on [0, 2tt] while 0(27i~) = 0(0) + 2nn. Both p(s) 
and 0(s) are real analytic functions of s. It is easy to see that for all large enough 
N the equation 

0( s ) = Ns mod (2tt) 

has N — n solutions in [0, 2n). If these are labelled in increasing order then s r+ ± — 
s r = 2it/N + 0(1/N 2 ). We will show that for large enough iV the solutions of 
z N = f(z) are very close to the points a r := p(s r ) 1 ^ N e lSr . □ 

Theorem 13. Given a G (1, 2) there exists a constant b such that for all large 
enough N and every r£ {0, iV — n — 1} the equation z N = f(z) has a solution 
z r satisfying 

\z r - a r \ < bN~ a 

To leading order the N — n solutions of z N = f(z) that are close to the unit circle 
are uniformly distributed around it. 

Proof. An elementary calculation shows that finding the solution of z N = f(z) 
closest to e lSr is equivalent to finding the solution of z N = f r (z) closest to 1, where 

f r (z) := e- i4,M f(e iSr z). 

We have 

/ r (e") = p^e^) 

where 

<j> r (s) := (j)(s r + s) - 0(s r ), 
p r (s) := p(s r + s). 

Moreover r (O) = and the equation <f> r (s) = Ns is equivalent to 4>(s r + s) = 
N(s r +s). From this point onwards we drop the subscript r, assume that 0(0) = 0, 
and leave the reader to verify that the bounds obtained are uniform with respect 
to r. 

We define the sequence u rn := r m e l6m for m = 1,2,... by U\ := 1 and u m+ \ := 
{f(u m )Y^ N , where we always take the A^th root with the smallest argument. Note 
that 9i = 9 2 = 0, r\ = 1 and r 2 = (/(l)) 1 ^. In the following arguments Cj denote 
positive constants that do not depend on A^ or m provided A^ is large enough. 

We prove that if 

S N ■= {re id : \r - r 2 | < N~ a and \9\ < N~ a } 
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then for all large enough N, u E Sn implies v := {/(u)} 1 ^ E Sn- Put u := re td 
and v := se^. If w E S n then 







|s"-r?| 





l/HI -]/(i)l I 

< Ci\U — 1| 

< a(\u - r 2 \ + \r 2 - 1|) 

< c 2 /iV. 



Therefore 



> rf - c 2 /N = |/(1)| - c 2 /N > c 3 > 
for some c 3 > 0. This also implies that r% > c 3 . Hence 

(7 := £ sV J 2 > Ncf ~ l)/N > iVc 4 . 

i+j=JV-l 

Combining the above estimates yields 

\s-r 2 \< c 2 /Na < c 5 /iV 2 < N~ a 



for all large enough N. 
We next observe that 

Ns N \(j)\ < c 6 \s N sm(N(f))\ = c 6 |Im 



.N N 



r?)\ < c 6 \v N -r 



N „AT| 
2 I 



Therefore 



ce\f(u)-f(l)\ < c 7 \u-l\ < c 8 /N. 
101 < c 8 /c 3 iV 2 < iV" a 



for all large enough N. 

Having established that Sn is invariant under the map u — > {f(u)} l l N provided 
N is large enough, we now apply a contraction mapping argument within Sn- Let 
Zj E S N and put Wj := : = {/(^)} 1/7V for j = 1,2. Then 

\ W 1 ~ W 2\ = \ f( Z l) - f( Z 2)\ < C 9\ Z 1 - Z lY 



Moreover 



E 

i+j=N-l 



w\w J 2 



> ]T Re (w[wi) 

i+j=N-l 

= Si4cos(i0i + j(j) 2 ) 

i+j=N-l 

> Nc 10 

where cio > 0. Therefore 

\w 1 - w 2 \ < c 9 \z l - z 2 \/c 10 N < \z l - z 2 \/2 

provided N is large enough. Since u 2 E Sn, the contraction mapping principle 
now implies that the sequence u m converges as m — > oo to a solution u E Sn of 
u N = f(u), again provided N is large enough. □ 
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Figure 3. Solutions of the polynomial equation of Example [TH 

Note Although we have proved that the eigenvalues of A all lie on or inside the 
unit circle asymptotically, this does not imply that |det(A)| < 1 asymptotically. 
Indeed det(A) = (— l) Ar_1 /(0) may be of any magnitude. If |/(0)| > 1 then the 



implies that p(s) > 1 on average, so the eigenvalues close to the unit circle are 
actually slightly outside it, again on average. 

Example 14. If there exists z such that \z\ — 1 and f(z) = then Theorem [TBI 
needs to be modified. The estimates in the theorem are local, so the conclusions 
are applicable to all the solutions of z — f (z) that lie in 



provided / does not vanish in this set. Figure [3] shows the solutions of z N = 
100(2 - 1) when N = 40. 



4.1. Other finite rank perturbations. In this section we allow the perturbation 
of the Jordan matrix J to have non-zero entries in all corners of the matrix. We 
do not require the perturbation to be small, since we have already indicated that 
this possibility can be accommodated by introducing a scale factor. 



bound 




{z : 1 — 8 < \z\ < 1 + 5 and a < arg(z) < /3} 



4. Some Generalizations 
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Let B, C, D, E be four k x k matrices and put 

A:=J + K (33) 

where N 3> k and K has the block form 

/ B C \ 
K := , 
\D 

the central entry being of size (AT — 2k) x (JV — 2&). A direct calculation shows 
that 

det(A/7v -A) = X N ~ h p(X) - q(X) (34) 

where p, q are polynomials of degree (at most) 2k which depend on B, C, D, E 
but not on N, and the X h coefficient of p is 1. 

The following theorem describes the asymptotic distribution of the eigenvalues of 
A as A^ — > oo. The proof is an obvious adaptation of the proofs of Theorems H2] 
and [T3l 

Theorem 15. Letp, q be two non-zero polynomials, let m be a large enough natural 
number and let < 5 < 1. Suppose that p(z) = and q(z) = have no solutions 
satisfying 1 — 5 < \z\ < 1 + 5. Then the solutions of 

z m p(z) = q(z) 

satisfying \z\ < 1 — 8 converge to the zeros of q in this region as m — > oo. The 
solutions satisfying \z\ > 1 + 5 converge to the zeros of p in this region. The 
solutions satisfying 1 — 5<\z\<l + 5 converge to the unit circle and are given 
asymptotically by Theorem\T^ where f(z) := q(z)/p(z). 

Example 16. We consider the above model with 




The determinant of A equals (—1)^. Its characteristic polynomial is z N 4 p(z) — 
q(z), where 

p( z ) ■= - 4z 3 + 6z 2 + 15z - 33, 
q(z) := -2z 2 -2z-l. 

The zeros of p are all outside the unit circle, at 2.0605±2.3672z, 1.7709 and —1.8920. 
The zeros of q are both inside the unit circle, at — 1/2 ± i/2. The remaining A^ — 6 
zeros of the characteristic polynomial are distributed almost uniformly around the 
unit circle. Figure H] was obtained by putting A^ = 50. 
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Figure 4 . Eigenvalues of the matrix A of Example [16] 
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\D 








4.2. Matrix Pencils. A direct calculation shows that the eigenvalues of the model 
operator f[3"3l are the same as those of the (2k + 1) x (2k + 1) matrix pencil 

A n (z) := P n (z) - J 2k +i ~ Q (35) 

where 

/ zl k 
P n (z) := z n ~ 2k 
\ zl k 

the central entries of both block matrices being of size lxl. By considering other 
finite rank perturbations of the Jordan matrix one is led to investigate the spectral 
asymptotics as n — > oo of the more general m x m matrix pencil 

A n (z) :=B(z)z n + C(z) 

where B(z) and C(z) are analytic matrix- valued functions of z, defined for all z in 
an open region U. In this more general problem, n and A n are not necessarily the 
same as in ( 1351) . 

The eigenvalues of this pencil are, by definition the values of z £ U such that 
det(A n (z)) = 0. One sees that 



rn 



det(A n (z)) = Y,Pr(z)z rn 

r=0 

where pa(z) := det(C(2;)) and p m (z) := det(B(z)). If p m does not vanish on U 
then the zeros of the above expression are the same as those of 

m— 1 

/„(*) := z mn + J2 Qr(z)z rn (36) 

r=0 

where q r := p r /p m are all analytic functions on U. 

14 



Theorem 17. Suppose that the function f n is defined by ( Tffpj) on open region U 
that contains B(l); suppose also that q r are bounded and analytic on U and that 
qo(z) does not vanish if \z\ = 1. Then f n has ran zeros in U for all large enough 
n. They have the following properties for all small enough 5 > 0. There are no 
zeros inUC\{z : \z\ > 1 + 5} provided n is large enough. The zeros of f n satisfying 
\z\ < 1 — S converge as n — > oo to the zeros of q satisfying the same bound. All 
the remaining zeros of f n lie in {z : 1 — S < \z\ < 1 + 6} and they converge to the 
unit circle as n — > oo. 

The proof is essentially the same as that of Theorem [T21 There is also an analogue 
of Theorem [T31 but we deal here only with the case m — 2. In other words we 
consider the asymptotic behaviour of the solutions of an equation of the form 

p n (z) := z 2n + qi (z)z n + q 2 (z) = (37) 

as n — > oo. Examples |2"U1 and |2"T1 illustrate the behaviour that we need to explain. 

The following lemma sets up some notation that will be used in the following 
theorem. 

Lemma 18. Let < 5 < 1/2 and let qi, q 2 be two bounded (uniformly in 5) 
continuous functions on A := {z : 1 — 5 < \z\ < 1 + 5} which are analytic in the 
interior of this annulus. Suppose that neither q 2 nor the discriminant v := q\ — 4q 2 
vanish anywhere in A. Let h, resp. k, be the winding numbers ofv(e 10 ), resp. 
q2{e ie ), around the origin, where 9 G [0,27r]. Then the number of solutions of (3l\ ) 
in A is 2n — k for all large enough n. 

Case 1 If h is even then there exist two non-vanishing analytic functions f± on A 
such that z 6 A is a solution of ( f57| ) if and only if either z n = f+(z) or z n = f-(z). 
Moreover q2(z) = f + (z)f^(z) for all z e A. If we define the real-valued analytic 
functions p± > and <p± on [0, 27r] by 

f±(e w ) :=p ± (6)e^ 
and put 2irk± := 0±(27r) — 0±(O) then k := k + + k_. 

Case 2 If h is odd then there exists a non-vanishing analytic function f on the 
double covering A of A such that z G A is a solution of P^ ) if and only if z n = 
f(zi) or z n = f{z2), where z\ and z 2 are the two points in A above z. Moreover 
Q2(z) = f(zi)f(z 2 ) for all z G A. If we define the real-valued analytic functions 
p > and (f> on [0, 4tt] by 

f(j 9 ) := P {oy m 

then 2rrk = 0(4vr) - 0(0). 

Proof. If \z\ > 1 + 5, then using the uniform boundedness of qi, q 2 we see that for 
n big enough the last two terms in fl3T|) are small compared to \z\ 2n ^> 1, hence p 
cannot vanish there and its 2n zeros are confined to a disc: 

— / ^dz = 2n. 

2-ni J\z\=i+s Pn(z) 

On the other hand, if \z\ — 1 — 5, then for n large enough the first two terms 
in ( 1371) are so small that adding them to q 2 will just have the effect of moving 
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its zeros slightly inside of \z\ < 1 — 5 (recall that these cannot lie on \z\ = 1 — 
5 by assumption). Hence in this case they are equal to the zeros of p n there 
asymptotically: 

— / ^4d,= fc , 

2m J\z\=i-s Pn{z> 

for all large enough n. If 7 is the difference of these contours then the number of 
zeros of p n in A is given by 

1 [ P ' n ^dz, 
2m 7 7 p n {z) ' 

which equals 2n — k. 

To prove the statements in Cases 1 and 2, one only has to observe that in the 
formula 

2f ± {z) = - qi {z)±JvJz) 

the square root has a single-valued branch on A if and only if h is even. The 
formulae for k follows directly from q 2 {z) = f + (z)f-(z) or q 2 {z) = f{z\)f(z2). □ 

Theorem 19. Under the assumptions of Lemma [731 the 2n-k solutions of (3l\ ) in 
A converge to the unit circle. More precisely there exists a constant c > such 
that for all large enough n every zero z e A satisfies 

1 — c/n < \z\ < 1 + c/n. 

Moreover the zeros are asymptotically uniformly distributed around the circle in 
the sense that 

Jim (2U)- 1 #{z : 9 < arg(z) < cf>} = (f> - 9 
provided < 9, <fi < 2tt. 

Proof. Case 1 follows directly from Theorem [T3], while Case 2 involves slight mod- 
ifications of the proof of that theorem. □ 

We conclude with two examples exhibiting the behaviour described in the two 
cases. 

Example 20. Consider the equation 

p n (z) := z 2n + gi (z)z n + q 2 {z) = 0, (38) 

where 

qi(z) := -z 2 -z + 9/2, 
q 2 ( z ) ■= z 3 - z 2 /2 - 4z + 2. 

The auxiliary equation 

w 2 + qi (z)w + q 2 (z) =0 (39) 
with z replaced by e %e has the two distinct solutions 

f + (9) := e w -l/2, 
f_(9) : = e 2ie -4, 

for all 9 e [0, 2tt]. We deduce that 1/2 < |/+(0)| < 3/2 and 3 < \f-(0)\ < 5 for all 
9 G [0, 2tt]. The winding numbers of these curves around the origin are m + := 1 
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and m_ := 0. The solutions of q^z) = are ±2 and 1/2. We should therefore 
anticipate that for large n the equation ( l38i) has one solution near z = 1/2 and 
two distinct rings of solutions, both close to the unit circle. One of these rings has 
7i — l points on it while the other has n points. 

This example is particularly simple because one can factorize (1381) in closed form, 
the left hand side being the product of z n — z + 1/2 and z n — z 2 + 4. Because the 
discriminant of the quadratic equation (139~1) is a perfect square its roots come in 
pairs, so there must be an even number inside the unit circle. If one replaces the 
coefficient 9/2 of q\ by 9 one obtains a more typical example in which the roots 
form two distinct rings. The zeros of this modified polynomial p n are shown in 
Figure 3, for n : = 40. 
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-2 -1.5 -1 -0.5 0.5 1 1.5 2 

Figure 3. Eigenvalues of the polynomial p^o of Example |2"01 
Example 21. Consider the equation 

p n (z) := z 2n -Az n -8z + 3 = 0. (40) 

The solutions of the auxiliary equation 

w 2 -Aw- 8e ie + 3 = 

are 

w :=2±^8e l0 + l 

and combine into a single closed curve winding twice around the origin and crossing 
itself on the negative real axis. Figure 4 shows the set of zeros of the polynomial 
equation (H01 for n := 20. 
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(41) 
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Figure 4. Eigenvalues of the polynomial P20 of Example I2T1 
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